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Confidence  Bands  Under  Proportional  Hazards1 

by 


Myles  Hollander  and  Edsel  Pena 
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Asymptotic  simultaneous  confidence  bands  are  derived  for  the  survival 
function  under  the  proportional  hazards  model  of  random  right -censorship. 

These  bands  are  based  on  the  maximum  likelihood  estimator  t(MLE)- of  the  survival 

function,  rather  than  the  well-known  product  limit  estimator  K^hE) .’  In  the 

hi-'- 


case  where  the  censoring  parameter,  denoted  by  fi,  is  known  the  bands  are 

Uv  r 

asymptotically  exact,  while  when  fe  is  unknown  the  bands  are  asymptotically 

b1**- 

conservative .  For  the  case  where  p  is  unknown,  the  proposed  bands  are  shown 
to  be  narrower  than  those  proposed  by  Cheng  and  Chang  (1985) .  Csorgo  and 


Horvath’s  (1986)  idea  of  mixing  bands  is  then  employed  to  obtain  even  narrower 
bands.  As  one  would  expect,  under  the  more  structured  model,  the  PLE-based 
band  of  Gillespie  and  Fisher  (1979)  is  shown  to  be  inferior  to  the  MLE-based 
bands,  and  this  inferiority  is  more  marked  as  the  degree  of  censoring  increases. 
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1.  Introduction  and  summary.  Let  (X^Y^),  i»l,...,n,  be  independent  and  iden¬ 
tically  distributed  random  vectors.  In  the  proportional  hazards  model  of  random 
right-censorship,  referred  to  in  the  sequel  as  the  Koziol-Green  (KG)  model,  X.^ 
and  are  independent,  the  X^'s  denote  the  survival  times  with  continuous 
survival  function  F(t)=P(X>t),  and  the  Y^'s  denote  the  censoring  times  with 
continuous  censoring  function  G(t)=P(Y>t) .  Furthermore,  this  model  asserts  that 
for  some  g*0,  possibly  unknown,  G(t)=F(t)  for  all  t^O.  The  censored  data  consist 
of  (Z^,6^),  i=l,...,n,  where  Z^=minimum(X.^) ,  6^=1  (X^<Y^)  and  1(A)  is  the  indi¬ 
cator  of  event  A. 

We  develop  asymptotic  (l-a)100%  simultaneous  confidence  bands  for  F  under 
the  KG  model.  The  bands  are  based  on  the  maximum  likelihood  estimator  (MLE)  of 
F  for  this  model.  These  bands,  which  are  derived  in  Section  2,  are  presented  in 
pairs  with  the  bands  in  each  pair  being  asymptotically  equivalent.  The  bands  in 
each  pair  are  of  the  forms 

(1.1)  ([Fn(t){l  +  rn(t;Or\  Fn(t){l  -  rn(t;»)>_1],  0<t*T> 
and 

(1.2)  {[Fn(t){l  -  rn(t;-)>,  Fn(t){l  +  rn(t;.)>],  OstsT), 

where  Fn(t)  is  the  MLE  of  F  given  in  (2.1),  and  rn(t;*)  depends  on  at  least  one 
tabular  value,  with  the  tabular  values  depending  on  g,  o,  and  H(T),  where  H=FG. 


In  the  case  where  g  is  known  the  bands  are  asymptotically  exact,  while  when  g  is 
unknown  the  bands  are  asymptotically  conservative,  though  less  conservative  than 
bands  proposed  by  Cheng  and  Chang  (1985)  under  the  same  model.  Csfirgo  and 
Horvath's  (1986)  idea  of  mixing  bands  is  then  employed  to  obtain  narrower  mixed 
bands.  In  Section  3  we  assess  the  adequacy  of  the  asymptotic  results  for  finite 
sample  sizes.  For  different  values  of  n,  g,  and  a,  estimates  of  the  achieved 
confidence  levels  for  each  of  the  bands  are  obtained  through  a  computer  simulation. 
In  Section  4  the  asymptotic  widths  of  the  bands  are  compared  analytically.  It  is 


shown  that  when  g  is  unknown,  some  of  the  bands  presented  here  are  narrower  than 


;s 


□  □ 


those  of  Cheng  and  Chang  (1985).  Furthermore,  when  B  is  large  and  the  KG  model 
holds,  the  MLE-based  mixed  bands  outperform  the  Gillespie  and  Fisher  (1979) 
band  based  on  the  Kaplan  and  Meier  (1958)  product-limit  estimator  (PLE)  of  F. 

The  KG  model  has  received  considerable  attention  in  the  literature.  This 
model  was  introduced  by  Koziol  and  Green  (1976)  in  the  context  of  developing  a 
PLE-based  goodness-of-fit  test  for  censored  data.  Earlier,  Efron  (1967)  used  a 
special  case  of  this  model  to  compare  the  efficiencies  of  various  two-sample 
tests  for  censored  data.  Under  this  model,  CsorgS  and  Horvath  (1981)  presented 
a  PLE-based  confidence  band  and  a  goodness-of-fit  test  for  F.  Their  goodness-of- 
fit  test  improved  upon  that  of  Koziol  and  Green  (1976) in  that  their  test  does 
not  require  that  8  be  known.  Also,  under  the  assumption  that  the  KG  model  holds, 
Chen,  Hollander  and  Langberg  (1982)  and  Wellner  (1985)  obtained  the  exact  moments 
of  versions  of  the  PLE,  and  studied  the  applicability  of  using  the  asymptotic 
variance  of  the  PLE  in  place  of  its  exact  variance  for  finite  samples.  Emoto 
(1984)  also  utilized  a  special  case  of  *he  KG  model  in  comparing  the  mean-square 
errors  of  the  PLE  and  the  MLE  of  the  survival  function. 

The  importance  of  the  KG  model  in  reliability  theory  and  survival  analysis 
should  not  be  underestimated.  The  model  arises  naturally  in  reliability  studies, 
and  in  particular,  in  competing  risks  models  (cf.  Example  1  of  Chen,  Hollander 
and  Langberg,  1982).  More  importantly,  its  great  tractability  enables  the  study 
of  the  performances  and  properties  of  other  methods  developed  for  more  general 
models  (such  as  the  random  right-censorship  model  where  no  structural  relation¬ 
ship  between  F  and  G  is  assumed)  in  settings  where  these  methods  are  not  optimal. 
In  this  spirit,  Chen,  Hollander  and  Langberg  (1982)  and  Wellner  (1985)  were  able 
to  study  how  well  the  asymptotic  variance  of  the  PLE  approximates  the  exact 
variance  when  the  KG  model  holds.  In  a  similar  vein,  we  are  able  to  study  the 
Gillespie  and  Fisher  (1979)  PLE-based  band  under  the  KG  model  where  it  has  the 


correct  asymptotic  coverage  probability  but  it  is  not  the  preferred  band.  In  many 
ways,  the  KG  model  assumes  a  position  in  reliability  models  similar  to  the 
role  that  its  precursor,  the  Lehmann  alternatives,  has  in  classical  nonparametric 
settings  such  as  the  one-  and  two-sample  problems.  Lehmann’s  (1953)  alternatives 
also  arise  naturally  in  certain  situations,  and  therefore  the  tests  derived  for 
these  alternatives  are  of  importance  in  their  own  right.  However  such  tests 
serve  also  as  standards  for  assessing  how  much  power  other  competing  tests  (such 
as  the  Wilcoxon  or  normal  scores  tests)  sacrifice  in  models  for  which  they  are 
not  optimal.  The  KG  model  and  procedures  derived  under  it  fulfill  a  similar 
role  in  the  domain  of  reliability  models. 


2.  Confidence  bands  for  the  survival  function.  The  maximum  likelihood  estimator 


of  F  under  the  KG  model  is  given  by 

H  (t)Y  if  6  is  known 

(2.1)  F  (t)  =  "  ,, 

H  (t)YtnJ  if  0  is  unknown, 
nv 


where  y=(1+0)-1,  Yn=Y(n)=n-1£”=1  6^^  and  Hn(t)=n~A  I(Z^>t) .  The  parameter  y 
equals  the  probability  of  an  uncensored  observation,  so  6  is  referred  to  in  the 
sequel  as  the  censoring  parameter.  For  ease  of  reference,  the  different  pairs 
of  bands  presented  below  are  labelled  by  Bn,  ...»  Hn,  with  the  bands  in  each 
pair  differentiated  by  a  superscript  of  1  or  2,  e.g.,  A*,  An,  etc.,  according 
to  whether  it  is  of  the  form  (1.1)  or  (1.2),  respectively.  In  (1.1)  and  (1.2) 
it  will  be  implicit  that  Fn(t)=Hn(t)Y  or  Hn(t)Y^  according  to  whether  0  is 
known  or  unknown,  respectively.  Furthermore,  to  facilitate  the  proofs  of  some 
results,  we  assume  the  existence  of  a  basic  probability  space  (fl,F,P)  such 
that  the  random  functions  discussed  below  are  measurable  maps  from  (ft,F)  to 
(D,P),  where  the  latter  is  Skorohod's  measurable  space  on  [0,T]  (cf.  Billingsley, 
1968 ,  Chapter  3) . 


■lrn 


■  1  rn 


We  first  present  the  bands  for  the  case  where  0  is  known.  With  minor 
modifications  of  Theorem  3  of  Cheng  and  Lin  (1984),  the  following  weak  convergence 


result  for  Fn(t)=Hn(t)Y  is  obtained. 

Theorem  2.1:  If  0<T<»  with  F(T)>0,  then  Wn(t)=n1/2[Fn(t)-F(t)]  =*  W(t)  on  D[0,T], 
where  W(t)  is  a  Gaussian  process  having  zero  mean  and  covariance  function 
Cov(W(s) ,W(t))=u(s)v(t)  for  Oss^tsT,  u(s)=yG(s)_1 [l-H(s)] ,  v(t)=yG(t) _1H(t) , 
and  "*=>"  denotes  "converges  weakly". 


Let  d_1(s)=inf{t:  d(t)£s)  be  the  inverse  of  d(t)=u(t)/v(t)=[l-H(t) ]/H(t) . 
Since  F  and  G  are  continuous  then  d”*(s)  is  strictly  increasing.  By  Theorem  2.1 
and  the  transformation  of  Doob  (1949)  the  following  corollary  is  immediate. 

Corollary  2.1;  If  0<T«*>  with  F(T)>0,  then  Wn(t)/[yF(t)  ]  =*•  B(d(t))  on  D[0,T], 
where  B(t)  is  the  standard  Brownian  motion  process. 


Corollary  2.2:  If  0<T<»  with  F(T)>0,  then  Wn(t)/[YFn(t)]  B(d(t))  on  D[0,T]. 


Notation:  In  the  remainder  of  the  paper,  supt  will  mean  suP0i;t<jT  and  limn 
mean  lira ,  except  when  otherwise  stated. 


will 


Proof  of  Corollary  2.2:  By  Theorem  4.1  of  Billingsley  (1968,  p.  25)  it  suffices 
to  show  that  supt|  (Wn(t)/Fn(t) }  -  {WR(t)/F(t)  )|  -*■  0  almost  surely  (a.s.).  By 
the  continuous  mapping  theorem  (Billingsley,  1968,  Theorem  5.1),  supt|Wn(t)|  -*■ 
supt|w(t)|  in  distribution.  Since  P{supt|W(t) | <®}=1,  it  therefore  remains  to 
show  that  supt|  (1/Fn(t) }  -  (1/F(t)}|  -*■  0  a.s. 

Let  e=F(T)>0  and  take  uigcGgsCuiefi:  supt|Hn(t)  -  H(t)  |  -*■  0).  Then  there  exists 
an  integer  N=N(u)q,e)  such  that  Hn(t) £(e/2) for  all  n£N.  Thus  for  this  u)Q  and 
n^N,  supt|{l/Fn(t)>  -  {l/F(t)}|s2E"2supt|Fn(t)  -  F(t)|=2E'2supt|Hn(t)Y-H(t)Yj  +  0. 
Since  u>q  is  arbitrary  and  P(Gq)=1  by  the  uniform  strong  consistency  of  Hn(t), 
the  proof  is  complete.  j| 


4 


Let  Qj(Xj,X2)  denote  the  probability  that  the  process  B(t)  lies  between 
the  lines  -(Xj  +  X2t)  and  (X^  +  X2t)  for  all  Ost^T.  From  Anderson  (1960),  we 
find  that  for  X1>0, 

QT(X1,X2)=P{|B(t) |SXX  +  X2t,  OstST) 

(2.2)  =l-2$[(-X2T-X1)/T1/2]-2  ^“=1exp(-2X1X2k2)« 

{* [ (X2T+2X1k+X1) /T1/2]  -  «[(-X2T+2X1k-X1)/T1/2]}, 

where  $(•)  is  the  cumulative  distribution  function  of  a  standard  normal  variable. 
Also,  denote  the  empirical  version  of  d(t)  by  dn(t)=[l-Hn(t)]/Hn(t) .  Then  the 
following  theorem  provides  the  An(t;X^,X2)  bands. 

Theorem  2.2:  Let  0<a<l  and  0<T<«  with  F(T)>0.  Then  the  bands  in  (1.1)  and 

(1.2)  with  rn(t;Xj,X2)=n_1^2Y[X^ + X2dn(t)]  are  asymptotic  (l-a)100%  simultaneous 

g  g 

confidence  bands  for  F  on  [0,T]  whenever  (X1,X2)=l-a. 

Proof:  By  the  choice  of  X*  and  X*,  and  Corollary  2.1, 

l-a=P{|B(t)|<X*  +  X2t,  OStSd(T)} 

=P{ | B(d(t) ) | SX^  +  X2d(t),  0st<T> 

(2.3)  =limnP{|Wn(t)/[YF(t)]|sXj  ♦  X*d(t) ,  OStsT). 

A  similar  argument  as  in  Corollary  2.2  shows  that  supt|dn(t)  -  d(t)  |  -*•  0  a.s., 
so  for  arbitrary  e>0,  (2.3)  can  be  bounded  from  below  and  above  by 
limnP{  |Wn(t)/[YF(t)]  |^Xj  +  X2[dn(t)±e],  0st<T).  Since  e  is  arbitrary,  it  follows 
that  l-a=limnP{ |Wn(t)/[yF(t)] |^Xj  +  X2dn(t),  0st<T}.  Inverting  the  set  inside  the 
braces  for  F(t)  yields  the  band  in  (1.1)  with  rn(t;XpX^)  given  in  the  theorem.  A 
similar  argument  with  Corollary  2.2  instead  of  Corollary  2.1  yields  the  band  in 

(1.2).  I! 

Confidence  bands  for  F  are  also  derivable  from  transformations  to  the  Brownian 
bridge  process  B°(t).  In  contrast  to  the  PLE  (cf.  Hall  and  Wellner,  1980),  the 
limiting  process  W(t)  in  Theorem  2.1  can  be  transformed  in  two  ways  to  B°(t). 


Theorem  2.3;  If  0<T<«  with  F(T)>0,  then 

(i)  {B°(L(t))}0stsT=  {W(t)[l-L(t)]/F(t)}0^T 

(ii)  ( B0(l-H(t) )  }Q^T  =  (W(t)H(t) /  [YF(t)  ]  }Q, ^  , 

where  means  equal  in  distribution,  and  L(t)=y2 [l-H(t) ]/{H(t)  +  y2[l-H(t)]}= 
y2d(t)/[l  +  Y2d(t)]. 


Cheng  and  Chang  (1985)  proved  (i),  and  since  the  proof  of  (ii)  is  straight¬ 
forward,  it  is  omitted  here. 


Corollary  2.3:  If  0<T<®  with  F(T)»0,  then 


(i)  Wn(t)[l-L(t)]/F(t) 


BU(L(t) )  on  D[0,T] , 


(ii)  Wn(t)H(t)/[YF(t)] 


Bu(l-H(t))  on  D[0,T] . 


Proof:  The  results  follow  from  Theorem  2.1  and  Theorem  2.3. 


Corollary  2.4:  If  0<T<®  with  F(T)>0,  then 

,0 


(i)  Wn(t)[l-L(t)]/Fn(t) 


(ii)  Wn(t)H(t)/[YFn(t)] 


B  (L(t))  on  D[0,T], 

0, 


s>  B  (l-H(t))  on  D[0,T] . 

The  proof  of  this  corollary  is  analogous  to  that  of  Corollary  2.2  except 
that  we  use  Corollary  2.3  instead  of  Corollary  2.1. 


.0, 


Crossing  probabilities  of  the  Brownian  bridge  process  B  (t)  have  been  well- 
studied  in  the  literature  (cf.  Doob(1949),  Anderson  (1960),  and  Hall  and  Wellner 
(1980)).  Here  we  only  need  to  know  that  for  0<T<1, 

,0, 


(2.4)  P{supt|B  (t)  |SA}=Q1jy£1_Tj(A,A). 

We  now  present  the  Bn(t;Ab)  and  Cn(t;Ac)  pairs  of  bands,  respectively. 


Theorem  2.4:  Let  0<a<l  and  0<T<«  with  F(T)>0.  Then  the  bands  in  (1.1)  and 


(1.2)  with 

(i)  rn(t;Ab)=n‘1/2Ab[l  +  y^OO], 
(ii)  r  (t;AC)=n'1/2ACY/H  (t) 


n 


are  pairs  of  asymptotic  (l-o)100%  simultaneous  confidence  bands  for  F  on  [0,T] 
whenever  l-a=QY2d(T)  (Ab,Ab)*Qdm  (AC,AC) . 

Proof.  Cheng  and  Chang  (1985)  proved  the  result  for  band  (1.2)  with  rn(t;*) 
given  by  (i).  The  proof  for  band  (1.1)  with  rn(t;»)  given  by  (i)  is  similar. 
Below,  we  just  present  the  proof  for  the  bands  with  rn(t;»)  given  by  (ii).  By 
the  choice  of  Ae,  we  have 

1-a=Qd(T)CAC,AC)=Q[i_H(T)]/H(T)(AC,AC) 

=P{suP0Stsl-H(T)lB°(t)lSxC}  (by  2‘4) 

=P{supt | B° (l-H(t) ) |SAC> 

=limnP{supt|Wn(t)H(t)/[YF(t)]|sAC) 

by  Corollary  2.3(ii).  Since  yfl  and  Hn(t)  are  uniformly  strongly  consistent 
estimators  of  y  and  H(t),  the  above  expression  is 

SlimnP{supt|Wn(t)Hn(t)/[ynF(t)]|5AC}. 

Inverting  the  set  inside  the  braces  for  F  yields  the  band  (1.1)  with  rn(t;*)  given 
by  (ii).  A  similar  argument  using  Corollary  2.4(ii)  yields  the  other  band.  || 


In  contrast  with  the  case  where  8  is  known,  it  is  difficult  to  construct 
asymptotically  exact  (l-a)100%  simultaneous  confidence  bands  when  8  is  unknown. 
The  explanation  for  this  is  deferred  until  after  the  statement  of  Theorem  2.5. 
Cheng  and  Lin  (1984)  gave  the  following  representation  for  Fn(t)=Hn(t)Y^ , 
the  MLE  of  F(t)  when  8  is  unknown: 

(2.5)  Zn(t)=n1/2[Fn(t)-F(t)]=Wn(t)+Vn(t)+Rn(t) 

where 

Wn(t) -yG(t) _1n1/2 [Hn(t) -H(t) ] ,  Vn(t) =yF (t) (fnF(t) }n1/2 (yn-y) , 
with  supt|Rn(t) | =0(£n£n(n)/n1^2)  a.s.  This  representation,  in  conjunction  with 
the  functional  and  univariate  central  limit  theorems,  implies  the  following 
weak  convergence  result. 
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Theorem  2.5:  If  0<T«*>  with  F(T)>0,  then  Zn(t)  =*>  Z(t),  where  Z(t)  is  a  Gaussian 
process  having  zero  mean  and  covariance  function  Cov(Z(s) ,Z(t))=u(s)v(t)  ♦ 
w(s)w(t)  for  0<sstsT,  where  u(s)  and  v(t)  are  defined  in  Theorem  2.1,  and 
w(s)=f31/2F(s)£nF(s). 

The  form  of  the  covariance  function  of  the  limiting  process  Z(t)  precludes 
the  possibility  of  transforming  Z  into  the  Brownian  motion  process  and/or  the 
Brownian  bridge  process  B°(t).  This  prevents  us  from  obtaining  exact  asymptotic 
confidence  bands  for  F  through  the  use  of  the  function  (^(A^^)  in  (2.2).  An 
ambitious  program  of  deriving  crossing  probabilities  of  Z(t)  might  be  possible, 
but  the  difficulty  might  far  outweigh  the  benefits  of  such  a  program.  We  shall 
therefore  content  ourselves  with  asymptotically  conservative  bands. 

From  (2.5)  it  is  clear  that  the  limiting  process  can  be  represented  by 
Z(t)=W(t)  +  V(t)  where  W(t)  is  defined  in  Theorem  2.1  and  V(t)  is  of  the  form 
(2.6)  V(t)=(B1/2F(t)£nF(t)}N, 

where  N  is  a  standard  normal  random  variable.  Furthermore,  since  Z i  and  6^  are 
independent  under  the  KG  model  (cf.  Armitage  (1959),  Allen  (1963),  and  Sethuraman 
(1965)),  the  limiting  processes  W(t)  and  V(t)  are  independent. 

Theorem  2.6:  Let  0<a<l  and  0<T<»  with  F(T)>0.  Then  the  bands  (1.1)  and  (1.2) 
with 

(i)  rn(t;Xd,zd)=n_1/,2{Xd[l  +  y^U)]  +  zd&^ 2  |£nFn(t)  |  } ,  and 

(ii)  rn(t;X®,X®,ze)=n-1/2{Yn[X®  ♦  A*dR(t)  ]  +  2  i  £nFn(t)  |  } 

are  pairs  of  conservative  asymptotic  (l-ct)100%  confidence  bands  for  F  on  [0,T] 
whenever  l-a=Q^2£j^T)  CXd,Xd)  [2 <t> ( zd) -1  ]  =Qd (A®, A®)  [2«(ze)-l] ,  where  Bn=  (1-Yn) hn* 

The  resulting  pairs  of  bands  are  respectively  referred  to  as  the  Dn(t;Ad,zd) 
and  En(t;A®,A®,ze)  bands.  Band  D2(t;Ad,zd)  was  introduced  by  Cheng  and  Chang  (1985). 


Below  we  present  only  the  proof  of  band  E^tjA®,^*^6)  since  the  proofs  for  the 
other  bands  are  similar. 

Proof  of  Theorem  2.6:  By  choice  of  A®*  A®  and  z®  and  by  Corollary  2.1, 
l-ct=P{  |  W(t)  /  [YF(t)  ]  |  £A®  +  A®d(t),  Ost<T}*P{  |N|<z®} 

<P{  |W(t)/[TFCt)] !  +  |V(t)/[YF(t)]U[A®  +  A®d(t)  ]  + 
z®|B1/2(B+l)£nF(t)|,  OstsT} 

<P{|Z(t)/[YF(t)]|s[A®  +  A®d(t) ]  +  z®|g1/2(B+l)£nF(t) | ,  0<t<T). 

Using  the  strongly  consistent  estimators  Y  ,  dn(t),  and  Fn(t)  for  Y,  d(t) , 

B  and  F(t),  respectively,  and  by  Theorem  2.5,  the  preceding  probability  is 
=limnP(|Zn(t)/[YnF(t)]|s[A®  +  A®dn(t)]  ♦  ze|6i/2(Bn+l)&iFn(t)|],  0<t<T). 
Inverting  the  set  inside  the  braces  for  F(t)  yields  the  band  E^tjA^A^z  ).  || 

Notice  that  the  first  inequality  in  the  proof  is  a  very  weak  one.  This 
makes  the  Dn-bands  and  En-bands  extremely  conservative.  We  could  have  obtained 
similar  bands  based  on  the  Brownian  bridge  transformation  in  Corollary  2.3(ii) 
and  Corollary  2.4(ii),  but  since  these  bands  are  extremely  conservative  we  do 

£  g 

not  pursue  them.  Instead,  the  Fn(t;A  )  and  Gn(t;A6)  pairs  of  bands  will  now  be 
developed. 

For  A>0  define  the  distribution  functions  (A)  and  Q^2-*  (A)  as 

(2.7)  qW  (A)  =  [c1(T)]-1(2/1r)1/2  /QX  (A-x,A-x)  • 

exp{- (1/2) [x/c1(T)]2}dx 
and 

(2.8)  Qj2)(A)  =  [c2(T)]-1(2/1r)1/2  J0XQd(T)(A-x,A-x)  • 

exp{- (1/2) [x/c2(T)]2)dx, 


where 


g1/2  C(T)  |£nH(T)  j  if  y2+y2-6iH(T)  +  (1-y2)H(T)*0 

c  (T)  = 

1  81/2  r(t0)|£nH(t0)|  if  Y2+Y2-enH(T)  +  (l-Y2)H(T)<0 

2  2  2 

with  tg  being  the  solution  of  the  transcendental  equation  y  +y  &iH(t)  +  (l-Y  )H(t)=0, 

I7(t)«l-L(t),  and 

B1/2H(T)|£nH(T)|  if  H(T)sexp(-l) 
c2Cf)  =  i/2 

6A//exp(-l)  if  H(T) <exp(-l). 

Theorem  2.7:  Let  0<a<l  and  0<T<»  with  F(T)>0.  Then  the  bands  (1.1)  and  (1.2) 
with 

(i)  rn(t;Af)=n"1/2Xf[l+Y2<in(t)],  and 

(ii)  rn(t;Ag)*n-1/2AgYn/Hn(t) 

are  pairs  of  conservative  asymptotic  (l-a)100%  simultaneous  confidence  bands 
for  F  on  [0,T]  whenever  1-ct  *  (Af)  =  Q.£2^(Ag). 

Proof:  We  present  only  the  proof  for  band  (1.1)  with  rn(t;*)  given  in  (i) .  The 
proofs  of  the  others  are  analogous.  By  Theorem  2.5,  we  have 

limnP{Supt|Zn(t)L(t)/F(t) |iJAf}=P{supt|Z(t)L(t)/F(t) |*Af  } 

=P(supt|W(t)L(t)/F(t)  +  V(t)L(t)/F(t)|£Af) 

iP(supt|W(t)L(t)/FCt)|  +  supt|V(t)L(t)/F(t)  |^Af> 

£ 

*/X  P(supjB°(L(t))  |sAf-x}dP{supJV(t)L(t)/F(t)  |*x} 

0  z  z 

by  Theorem  2.3(i)  and  the  Convolution  Theorem.  But  by  (2.6),  (supt| V(t)L(t)/F(t) |^x} 
={  |N|sx/ [supt|  B1^2L(t)£nH(t)  1 1),  and  since  Cj  (T)=supt  |  B1/,2L(t)£nH(t)  | ,  then 

£ 

limnP(supt|Zn(t)L(t)/F(t)|sAf} =/  QL(T)/ [i_L(T) ] (Af-x,A£-x)dP{ |N| Sx/Cj (T) } 

which  by  the  choice  of  A^  and  the  fact  that  L(T)/[l-L(T)]=Y2d(T)  equals  (A^)=l-a. 

2  -1 

Substituting  the  uniformly  strongly  consistent  estimator  Ln(t)=[l+Yndn(t)]  for 
E(t),  and  then  inverting  the  set  {sup^l Zn(t)L(t)/F(t) |sA^}  for  F(t)  yields  the 
band  F*(t;Af) .  || 
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The  pairs  of  bands  in  Theorem  2.7  are  respectively  referred  to  as  the 

.  >  f  N J  f*  t  ..  .  \  Xn  L J  -  *T*  _  ■  .  _  * _  _ a  _  /ir\  •  .  1  1  •  .  *  i 


f  2 

Fn(t;A  )  and  Gn(t;As)  bands.  To  compute  the  constant  c^T)  in  the  distribution 


'(A)  in  (2.7)  a  transcendental  equation  has  to  be  solved.  However,  since  the 


bands  are  asymptotically  conservative,  we  might  lessen  this  conservatism  by 


using  cJ(T)  instead  of  CjCT),  where  the  former  is  defined  as 


C*(T)=  e1/2T(T)|£nH(T)|  if  Y2n2-enH(T)  +  (l-Y2)H(T)^0 
1  [1+Y2(e-1)]-1  if  Y2+Y2-2nH(T)  +  (l-Y2)H(T)<0. 


Note  here  that  since  c^T^c^T)  the  tabular  value  Ar  obtained  using  cJ(T)  will 
be  at  most  the  tabular  value  obtained  using  c^ (T) . 

To  facilitate  the  use  of  the  F  (t;A^)  and  G  (t;A®)  bands,  selected  percentiles 


for  the  distributions  (A)  and  Q,^  (A)  are  given  in  Tables  1  and  2,  respectively. 


In  computing  the  percentiles  of  Q^,  ^ (A)  in  Table  1,  cJ(T)  was  used  in  place  of 
CjCT).  The  IMSL  routine  DCADRE  was  employed  to  perform  the  numerical  integration 


in  (2.7)  and  (2.8).  To  use  Tables  1  and  2,  the  user  needs  to  know  the  values 


of  0  and  F(T) ,  or  at  least  the  values  of  &n  and  Fn(T) ,  the  latter  quantities 


being  the  strongly  consistent  estimators  of  fj  and  F(T),  respectively.  This 


technique  of  substituting  empirical  versions  in  place  of  unknown  quantities 


is  also  required  for  determining  the  tabular  values  of  the  other  bands.  Clearly, 


the  finite  sample  properties  of  the  bands  will  be  affected  by  substituting  these 


empirical  versions,  but  the  asymptotic  properties  will  remain  unchanged. 


By  examining  the  upper  and  lower  contours  of  the  bands  in  each  pair,  one 


notes  that  the  lower  and  upper  contours  of  band  (1.1)  are  always  above  the 


corresponding  contours  of  band  (1.2),  with  the  relationship  for  the  upper 


contours  holding  whenever  l-rn(t;«)>0.  We  could  therefore  adapt  the  idea  of 


Csorgo  and  Horvath  (1986)  of  mixing  the  bands  in  each  pair  to  obtain  a 


narrower  band.  For  example,  we  could  define 


A*(t;A1,A2)=An(t;A1,A2)  n  An(t;A1,A2) 


VL 

'•Wlv, 


as  the  Admixed  band.  One  must  however  be  cautious  in  using  such  mixed  bands 
since  as  pointed  out  in  Csorgo  and  Horvath  (1986) ,  the  finite  sample  coverage 
probability  might  be  nowhere  from  the  nominal  asymptotic  coverage  probability 
of  1-a  if  the  bands  being  mixed  are  appreciably  different.  This  idea  of  mixing 
bands  to  obtain  narrower  ones  is  however  very  appealing  for  the  case  where  B 
is  unknown,  since  the  bands  being  mixed  are  asymptotically  conservative  and 
the  mixed  bands  might  still  turn  out  to  have  finite  sample  coverage  probabili¬ 
ties  of  at  least  1-a. 

Our  comparisons  in  Sections  3  and  4  include  the  PLE-based  GF-type  bands 
(cf .  Gillespie  and  Fisher,  1979  and  Csorgo  and  Horvath,  1986) .  This  pair, 
labelled  the  H^tjAj.A^)  bands,  are  given  by 

(2.9)  H^(t;Aj,Al;)={[Sn(t){l  ♦  ^(tjA^A^) 

Sn(t)(l  -  ^(tjAj.A^  )}_1],  Ost<T> 

and 

(2.10)  H2(t;Ai,A2)={[Sn(t)(l  -  r^tjA^A^)}, 

sn(t)U  +  r^tjAj.A^)}],  OstsT), 
where  r^tjA^A^n'172^  +  ^d*(t)L 

^yd(T) (X1»A2^  = 1_°» 

dn(t)s-^[Sn(u)Hn^]”1<^Sn^  * 

and  S  ft)  is  the  PLE  of  F.  The  PLE  is  defined  as 
n 

Sn(t)="{i:Z.st}[(n-Ri)/(n-Ri*1)1<ill(t<'Daxlsisn2i)> 
where  is  the  rank  of  Z ^  in  the  lexicographic  ordering  of  (Zi,5i),  i=l,...,n, 
and  with  the  convention  that  the  product  over  an  empty  set  equals  1. 

3.  Adequacy  of  the  asymptotic  results  for  finite  samples.  To  assess  the  adequacy 

of  the  asymptotic  results  for  finite  sample  sizes,  a  computer  simulation  was 

2  2 

performed  to  obtain  estimates  of  the  error  probabilities  of  the  bands  A^  ...,  HR. 
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For  each  combination  of  n,  8  and  a,  500  randomly  censored  samples  were  generated 

via  the  KG  model  with  F  exponential  with  scale  parameter  1  and  G  exponential 

with  scale  parameter  6.  For  each  of  these  500  samples,  the  asymptotic  (l-a)100% 

2  2 

simultaneous  confidence  bands  An,  ....  Hn  for  F  on  the  interval  [0,1]  were 
constructed.  The  proportion  among  the  500  bands  that  did  not  contain  F(t)  on 
the  interval  [0,1]  was  then  determined  for  each  of  the  bands  An,  ...,  H^.  The 
values  of  n  were  set  to  50,  100  and  200,  while  the  asymptotic  error  levels  a 
were  set  to  0.01,  0.05  and  0.10.  The  censoring  parameter  8  took  the  values  0.5, 
1.0  and  2.0,  which  amounted  to  33%,  50%  and  67%  censoring,  respectively.  This 
simulation  was  performed  on  a  Cyber  730  computer  at  the  Florida  State  University 
Computer  Center.  The  uniform  random  number  generator  used  was  the  intrinsic 
routine  RANF,  and  a  result  by  Lurie  and  Hartley  (1972)  concerning  the  sequential 
generation  of  ordered  samples  without  recourse  to  sorting  was  employed. 

Table  3  is  a  summary  of  the  asymptotic  tabular  values  [that  is,  in  computing 
the  tabular  values  we  used  the  known  values  of  8,  F(t)  and  G(t)]  utilized  in 

constructing  the  bands  for  the  different  values  of  8  and  o.  We  imposed  the 

0  ^  h  h  3.  a 

restriction  Xj^Xj  an<*  t*ie  tabular  values  Xj^  and  X 2  were  chosen  so 

the  asymptotic  widths  of  A^CtjX^X^)  at  t=0  and  t=l/2  were  approximately  equal. 

Tables  4a-c  are  summaries  of  the  observed  error  probabilities.  Based  on 

these  results,  the  following  rough  conclusions  can  be  made: 

(i)  When  the  censoring  proportion  is  small  (8=0.50),  the  asymptotic  error 

level  serves  as  an  acceptable  approximation  to  the  true  error  level  of  the  bands 

for  moderate  sample  sizes.  This  conclusion  is  supported  by  the  fact  that  the 

2  2  2 

observed  error  levels  of  the  asymptotically  exact  MLE-based  bands  An,  Bn  and  Cn, 

2 

and  the  PLE-based  band  H  are  close  to  a.  However,  as  8  increases  from  0.5  to 

n 

1.0  and  2.0,  the  observed  error  levels  for  n=50  are  far  from  a  although  they 
do  get  closer  to  a  when  n  is  increased.  Thus,  generally,  the  more  censoring 


there  is,  the  larger  the  sample  needed  for  the  asymptotic  levels  to  serve  as 

good  approximations  to  the  finite  sample  error  levels  of  the  bands. 

2  d  d  2  G  6  G 

(ii)  The  bands  Dn(t;A  ,z  )  and  D^(t;A  ,A  ,ze)  are  extremely  conservative, 

2  £  2  2  2  2  2 
while  F^(t;A  )  and  G^(t;Ag)  are  less  conservative  than  and  E^,  with  F^  tending 

2 

to  be  less  conservative  than  G  . 

n 

4.  Comparison  of  widths  of  the  bands.  In  this  section  we  present  analytic 

comparisons  of  the  asymptotic  widths  of  the  bands,  and  determine  regions  where 

each  tends  to  perform  well.  Let  o»n(t;«)  denote  the  width  of  the  bands  (1.2)  and 

1/2 

(2.10),  and  define  the  asymptotic  width  to  be  lim^ )an  ( t ; * ) .  To  distinguish 

which  pair  of  bands  is  referred  to,  let 

(4.1)  xe(t;*)=limnn1/2a)n(t;*)=2F(t)limnn1/2rn(t;»)  a.s., 

where  0  is  the  letter  label  of  the  pair  of  band,  e.g.,  the  asymp¬ 

totic  width  of  pair  An.  In  (4.1)  we  used  the  strong  consistency  of  Fn(t)  and 
Sn(t)  for  F(t) ,  and  the  fact  that  the  asymptotic  widths  of  the  bands  in  (1.1) 
and  (1.2),  and  in  (2.9)  and  (2.10)  are  equal.  Table  5  is  a  summary  of  the 
asymptotic  widths  of  the  different  pairs  of  bands,  and  the  conditions  that  their 
corresponding  tabular  values  must  satisfy.  The  asymptotic  width  of  the  PLE-based 
Hn~band  is  obtained  by  noting  that 

d^(t)=-/ot[SnCu)Hn(u)]"ldSn(u)^  yd(t)  a*s* 
when  the  KG  model  obtains. 

^  j  ^  G  G  G 

To  simplify  our  comparison  below  we  assume  that  A^=A2=A  »  *i=^2=*  and 

Under  the  first  assumption,  band  Cn  is  identical  to  band  An.  We 

now  compare  the  asymptotic  widths  of  the  bands  for  the  case  where  6  is  known. 

Since  d(T)2yd(T)2y2d(T)  it  follows  from  (2.2)  that  Aa>Ah>Ab.  This 

l  K  h 

immediately  implies  that  Xg(t;A  )sxH(t;A  ,A  )  for  OstsT,  and  the  difference 
in  asyurototic  width  is  2F(t)  { (Ab-Ab)  +  yd(t) (Xb-yAb) }  which  gets  large  for  large  t. 
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The  effect  on  this  difference  due  to  a  change  in  6  is  not  very  clear  since 

Xb  and  X*1  both  depend  on  8.  Nevertheless,  by  acting  as  if  Xb3Xb,  which  is 

indicated  by  Table  3,  the  maximum  difference  occurs  at  approximately  8=1  and 

decreases  as  8  becomes  different  from  1  in  either  direction.  On  the  other 

hand,  the  difference  in  asymptotic  widths  of  bands  Hn  and  ^  is 

2F(t) { (Xb-Xay)  +  yd(t) (Xb-Xa) },  and  since  Table  3  also  indicates  that  Xb*Xa=Xc, 

this  difference  is  positive  and  increases  with  either  an  increase  in  8  or  a 

decrease  in  t.  Thus,  both  MLE-based  bands  outperform  the  Gillespie-Fisher  PLE- 

based  band  in  terms  of  their  asymptotic  widths. 

It  is  a  simple  algebraic  exercise  to  show  that 

XA(t;Xa,Xa)^xB(t;Xb)  «=s>  tsH_1{Y(Xa-YXb)/[Xb(l-Y2)]}. 

Since  XasXb  it  follows  that  H"1{y(Xa-YXb)/[Xb(l-Y2)]}sH“1{Y/(l+Y)} = F_1{ [y/(1+y)]Y}. 

v 

Furthermore,  since  (y/Q+y)}  decreases  rapidly  from  1.0  to  0.5  as  8  increases  from 

0  to  1,  we  deduce  that  the  interval  {t:  Xa, Aa) ^xB Ct ; gets  shorter  as  8 

increases,  but  does  so  at  a  slow  rate.  Adapting  again  the  idea  of  a  mixed  band, 

we  can  define  the  band 

(AB)n(t;Xa,Xa,Xb)  =  An(t;Xa,Xa)  nBn(t;Xb), 

which  would  still  be  an  asymptotically  (l-a)100%  simultaneous  confidence  band  for 

F,  but  whose  finite  sample  coverage  probability  might  be  much  less  than  1-a. 

d.  © 

Similar  results  carry  over  to  the  case  where  8  is  unknown.  Setting  z  =z  , 
we  have 

XD(t;Xd,zd)2:xE(t;Xe,Xe,ze)  <t=>  tSH-1(Y(Xe-XdY)/[Xd(l-Y2)]>, 

and 

xF(t;Xf)2xG(t;Xg)  <*=*  tsH‘1{Y(Xg-XfY)/[Xf(l-Y2)J}. 

Thus  the  E  —  and  G  —bands  are  narrower  for  small  t,  while  the  D  —  and  F  -bands 
n  n  n  n 

are  narrower  for  large  t.  From  the  derivation  of  the  bands  in  Section  2  and 
the  simulation  results  of  Section  3  we  have  seen  that  the  Dn—  and  En— bands  are 
extremely  conservative.  Intuitively,  we  therefore  expect  them  to  be  wider  than 
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Table  5.  Asymptotic  Widths  of  the  Pairs  of  Bands  An  to  Hn  with 
Corresponding  Requirements  for  Their  Tabular  Values 


Band ,  6 

[2F(t)]-1X0 (t; •) 

Requirements  for 
Tabular  Values 

A 

Y[xf+X«d(t) 1 

WXt’X2)=1-° 

B 

Ab[l+Y2d(t)] 

Vd(T)(,b-lb>=1-“ 

C 

xcy/H(t) 

Qd(T)(xC'XC)=1“a 

D 

Ad[l+Y2dCt)]  +  2dB%|£n  F  (t)  | 

Vd(T  )^d.xd)[2$(zd)'1]  =  l-“ 

E 

Y[X®+X*d(t)]  +  ze8%|£n  F(t)l 

Qd(T)(xrx2)[2$(ze)"1]=1_a 

F 

Xf[l+Y2d(t)] 

Qj1^ (Xf)=l-a 

G 

XgY/H(t) 

Q^2)(Xg)=l-a 

H 

[Xb+XbYd(t)] 

— 

^ydm  (XrX2)  =  1-“ 
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the  less  conservative  bands  F„  and  G  .  We  show  this  below.  From  Table  5  it 

n  n 

is  easy  to  verify  that  Xg(t; Xe, Xef ze)^Xg(t; X&)  if  and  only  if  t  satisfies  the 

transcendental  equation 

(4.2)  H(t)H(t;)  Sexp{-  (Xg-X*)  /  (zV/2)  } . 

The  function  in  the  left  side  of  (4.2),  which  is  graphed  in  Figure  1,  has  a 
minimum  value  of  0.6922  at  H(t)=exp(-1) ,  and  drops  rapidly  from  1.0  and  slowly 
climbs  back  to  1.0  as  H(t)  varies  from  0  to  1.  This  is  the  reason  why  the 
interval  (t:xc(t;X  ,X  ,z  )^x^(tJ^6)^  is  wide.  For  example,  using  the  tabular 
values  in  Table  3  and  referring  to  Figure  1,  the  Gn- band  is  narrower  than  the 
En— band  in  the  approximate  intervals  (0.07,2.46),  (0.08,1.55)  and  (0.07,0.93) 

1  J  £ 

for  8=0.5,  1.0  and  2.0,  respectively.  Similarly,  xD(t;X,z  )^Xp(t;Xr)  whenever 
t  satisfies 

H(t)H(t)sexp{-(Xf-Xd)/[zd61/2(6+l)]>. 

Referring  again  to  Figure  1  and  using  the  tabular  values  in  Table  3,  we  find 
(0.05,2.71),  (0.03,2.13)  and  (0.02,1.64)  to  be  the  approximate  intervals  where 
band  Fn  is  narrower  than  band  Dn  for  8=0.5,  1.0  and  2.0,  respectively.  For  all 
practical  purposes,  we  can  therefore  consider  the  Fn-  and  Gn-  bands  to  be 
narrower  than  the  Dn—  and  En— bands  in  the  regions  where  they  perform  well.  As 
in  the  case  where  8  is  known,  we  could  also  define  the  mixed  band 
(FG)n(t;Xf,Xg)  =  Fn(t;Xf)  n  Gn(t;Xg) 

which  will  be  narrower  on  almost  the  whole  region  of  interest  as  compared  to  the 
other  bands  presented.  The  simulation  results  in  Section  3  indicate  that  band 
(FG)n  may  still  be  conservative. 

The  PLE-based  GF-type  H  -band  is  wider  than  the  F  -  and  G  -bands  whenever 

1  n  n  n 

t2H_1(n)  and  t<H_1(5),  respectively,  where  n={y(X^y-X}')  }/{  (1-y)  [Xh-X^(l+y)  ] } 

ah  h 

and  £*(y(X8-X  ) }/{X  (1-y) >.  To  get  some  idea  on  these  changeover  points,  notice 
that  except  for  the  case  8=0.5  and  a=0.10  in  Table  3,  H  ^(n)sH  (5).  Thus,  in 


Figure  2b.  Plots  of  90%  Confidence  Bands  for  Randomly- 
Generated  Data  Under  the  KG  Model  with  B=  1.0 
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Figure  2c.  Plots  of  90%  Confidence  Bands  for  Randomly 
Generated  Data  Under  the  KG  Model  with  6  =  1.5 


Figure  2d.  Plots  of  90%  Confidence  Bands  for  Randomly 
Generated  Data  Under  the  KG  Model  with  6  =  2.0 


roost  situations  we  expect  band  (FG)n  to  be  narrower  than  band  Hn  on  almost  the 
whole  region  of  interest. 

2 

In  Figures  2a-d  we  present  superimposed  plots  of  the  MLE-based  bands  D  , 

2  2  2  .  2 
E“,  Fn  and  Gn,  and  the  Gillespie-Fisher  PLE-based  band  Hn.  These  bands  are  asymp¬ 
totic  90%  simultaneous  confidence  bands  for  F  based  on  four  randomly  generated 
sets  of  censored  data  each  of  size  100.  The  MLE  and  the  PLE  of  F  are  also 
plotted.  In  plotting  the  bands  and  estimators  via  the  Versatec  plotter,  we 
have  taken  the  liberty  of  connecting  adjacent  points  by  straight  lines.  The 
four  data  sets  were  generated  via  the  KG  model  considered  in  the  simulation, 
with  the  censoring  parameter  g  taking  values  0.50,  1.00,  1.50  and  2.00.  In 
constructing  the  bands,  T  was  set  equal  to  1.0.  Notice  that  when  g  is  small 
the  differences  among  the  bands  are  not  very  apparent,  but  as  g  increases  the 
differences  among  the  bands  are  clear-cut.  These  plots  are  consistent  with 
our  analytical  results  showing  the  Fn-  and  Dn-bands  are  narrower  for  large  t, 
and  the  Gn~  and  En-bands  are  narrower  for  small  t.  The  small  interval  emanating 
from  0  where  band  En  is  narrower  than  band  Gn  is  also  visible  in  Figures  2a-d. 
Finally,  under  the  KG  model  it  is  seen  that  the  PLE-based  band  holds  its  own 
with  the  MLE-based  bands  when  the  censoring  proportion  is  small  (g=0.50),  but  as  g 
increases  it  is  easily  outperformed  by  the  MLE-based  bands. 
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